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© Recombinant DNA technology is used to produce the 
transforming growth factor-o. (TGF-«I precursor and its 
fragments. TGF-« precursor and its fragments, including 
particularly mature TGF-a, and optionally in combination 
with human transforming growth factor-B, has utility for 
example in the therapeutic treatment of human beings for 
*"j bone diseases and to accelerate wound healing. In addition, 
^ TGF-a is useful es an edjuvsnt for cell culture so as to reduce 
requirements for serum In the culture media {thereby 
JOT resulting in purification advantages), and to stimulate or 
W enhance cell growth in cell culture. Also, preparation of large 
T quantities of the TGF-a precursor and its fragments enables 
. the preparation of reagents for the assay of the TGF-ot 
S precursor and its fragments in body fluids for the diagnosis 
~ of neoplastic or other diseases. 

O 

0. 
UI 



Croydon Printing Company Ltd. 



1 



100/196,279 

P154434 



HUMAN TRANS FORMING GROWTH FACTOR AND PRECURSOR 
OR FRAGMENT THEREOF, CELLS , DNA, VECTORS AND 
METHODS FOR THEIR PRODUCTION, COMPOSITIONS 
AND PRODUCTS CONTAINING THEM, AN D RELATED 
ANTIBODIES AND DIAGNOSTIC METHODS 



The present invention relates to human precursor trans- 
forming growth factor-a and its fragments, notably mature human 
transforming growth factor-a (TGF-a), corresponding to that found in 
human tissues and to novel forms and compositions thereof and 
methods for production to homogeneity in therapeutically and/or 
diagnostically significant quantities. 

The present invention enables the production of sufficient 
quantities of high purity material in comparison to the isolation 
methods previously employed involving production and extraction from 
existing cell cultures, which naturally include undesired proteins 
and which are only available in extremely small quantities. 

The publications and other materials cited herein are 
incorporated herein by reference and, for convenience, generally are 
numerically referenced in the following text and respectively 
grouped 1 . "iji the appended bibliography. 
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Transforming growth factors (TGFs) are factors which can 
elicit a phenotypical transformation of normal cells in a reversible 
way. It has been shown that administration of TGFs apparently 
stimulates normal cells to undergo uncontrolled growth of the cells 

5 and promotes anchorage independence as measured by formation of 

transformed cell colonies in soft agar (1-3). Two classes of TGFs 
have been distinguished. TGF~ a is secreted by a wide variety of 
tumor cells from human or rodent origin (4-7). TGF-a and epidermal 
growth factor (EGF) are reported to compete for the same receptor 

1C (5, 8, 9), which is phosphorylated at tyrosine residues after the 
binding of TGF-a or EGF (10-11). Some evidence has been presented 
for the presence of another receptor, specific for TGF-a (12). The 
anchorage independent growth triggered by TGF-a is strongly 
potentiated by TGF-p (13, 14). This latter TGF has been detected in 

15 many normal and tumor cells (13-17) and has been purified from 
kidneys (18), placenta (19) and platelets (20). TGF-p is not 
believed to bind to the EGF receptor and is believed to require EGF 
or TGF-a for its transforming activity or NRK cells (13-17). 

20 The biological role of the TGFs has not been clearly 

elucidated. Many studies suggest that TGFs may play an important 
role in the transformation event. It has been shown that cellular 
transformation with retroviruses (21-24), SV40 (25) or polyoma virus 
(26) results in the secretion of TGF- a . The tight linkage of TGF 

25 secretion to cellular transformation has been indicated by 

transfection experiments with polyoma virus DMA, which show that 
introduction of the DNA segment for middle T antigen is needed and 
sufficient to trigger both the transformed "phenotype and the TGF 
secretion (26). Transformation studies with a temperature-sensitive 

? : mutant of Kirsten murine sarcoma virus also indicate that TGF-a is 
secreted only when phenotypic transformation occurs- at the 
permissive temperature (21). In addition, recent studies indicate 
that introduction of the cloned T24 bladder oncogene induces TGF 
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production (27). The biological relevance in tumor development is 
also suggested by the presence of activity identified as that of 
TGF-a in the urine of cancer patients, in contrast to the normal 
controls (28-30). The assay used, however, would not distinguish 
tiie activity of other growth factors such as EGF. These and other 
observations suggest that TGF- a may play a role in tumor formation 
via an autocrine mechanism, by which the TGFs are secreted by the 
transformed cells and maintain and stimulate this transformed 
character of the same cell population (31-32). However, the 
potentiating effect of TGF-b may be, needed as jsuflflspljed by .the 
secretion of both TGF'-'a and' -e "by "tumor'ceTlV (14)'!*'Tn this' way, 
TGF-a may be a very potent effector molecule during malignant 
transformation. 

Heterogeneous molecular weights for TGF-a are reported for 
extracts and supernatants from tumor cells (5, 12, 22, 33-34) and in 
urine (28-30). A small species of about 7 kilodaltons has been 
purified from both rodent (27, 35) and human (34, 35) cellsources. 
Amino acid sequence analysis of this rat and mouse TGF-a shows some 
homology with EGF (27, 35). The reported partial polypeptide 
sequence of the human TGF-a shows a strong homology with the rat and 
murine species (35). 

It has been observed that patients with metastasized renal 
cell carcinoma can develop a progressive decalcification of the 
bone, which is reflected in a humoral hypercalcemia (36). A recent 
study using impure protein preparations suggests that transforming 
growth factors (including TGF- a ) may cause bone resorption in a 
tissue culture system (37). 

TGF-a can only be produced in such limited quantity by 
virus transformation of cells as to be Impractical for 
aforementioned use. as a^ therapeutic or diagnostic reagent- 
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Recombinant DHA Technology 

Heterologous proteins, i.e., proteins not normally produced 
by a cell, are synthesized by cells that have been transformed by 

5 exogenous DNA. This typically is accomplished by introducing 

foreign DNA into a cell in the form of a vector. DHA recombination 
of the elements of a vector, i.e., an origin of replication, one or 
more phenotypic selection characteristics, an expression promoter, 
heterologous gene insert and remainder of the vector, generally is 

10 performed outside the host cell. The resulting recombinant 

replicable expression vehicle, or plasmid, is introduced into cells 
by transformation and large quantities of the recombinant vehicle 
obtained by growing the transformant. Where the gene is properly 
Inserted with reference to portions which govern the transcription 

15 and translation of the encoded DNA message, the resulting expression 
vehicle is useful to actually produce the polypeptide sequence for 
which the inserted gene codes, a process referred to as expression. 
The resulting product may be obtained from intracellular locations 
by lysing the host cell, or from culture media in the case of 

20 secreted products and thereafter recovered by appropriate 
purification from contaminant proteins. 

Summary of the Invention 

25 The present invention is based upon the discovery that 

recombinant DNA technology can be used successfully to produce the 
transforming growth factor-a (TGF-a) precursor and its fragments in 
amounts sufficient to initiate and conduct animal and clinical 
testing as prerequisites to market approval. TGF-a precursor and 

30 its fragments, including particularly mature TGF-a, and optionally 
in combination with human transforming growth factor-B, has utility 
for example in the therapeutic treatment of human beings for bono 
diseases and to accelerate wound healing. In addition, TGF-a is 
useful as an adjuvant for cell culture so as to reduce requirements 
"tiMj£Wtf4fr«ftftfr-< n the culture- metfta f t*eWby*"retultf ng in purification 
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advantages), and to stimulate or enhance cell growth in cell 
culture. Also, preparation of large quantities of the TGF-a 
precursor and its fragments enables the preparation of reagents for 
the assay of the TGF-a precursor and Us fragments in body fluids 
5 for the diagnosis of neoplastic or other diseases. 

The complete nucleotide and imputed amino acid sequence for 
the TGF-a precursor is depicted in Fig. 3. For convenience in 
identifying the various principal domains of the precursor we have 

10 designated three polypeptides, mature TGF-a (depicted as the boxed 
sequence in Fig. 3 at residues 40-89), the TGF-a C-terminal polypep- 
tide at residues 90-160 {principally the 62 residue polypeptide from 
Gin 98 to Val 160), designated herein as TGF-aC, and the TGF-a 
H-terminal polypeptide from Met 1 to Ala 22 (TGF-«N). As such 

15 therefore, the TGF-a precursor is a TGF-a bearing polypeptide 

including mature TGF-a and is in effect a fusion of TGF-a with its 
normal flanking sequences. Also, for convenience, the TGF-a pre- 
cursor and Its fragments (including mature TGF-a, TGF-aC and TGF-aN) 
collectively will be referred to herein as PRTGF-a species. The 

20 term TGF-a shall mean mature TGF-a. Also, unless otherwise stated 
the term PRTGF-a species shall be deemed to include PRTGF-a species 
derivatives such as amino acid sequence mutants as are more fully 
described infra . Another PRTGF-a species fragment included herein 
is composed of TGF-aC and mature TGF-a. This polypeptide, desig- 

25 nated TGF- a mC and extending from residues 20-160, is believed to be 
the an early product of precursor TGF- a expression in mammalian 
cells since the TGF-aN sequence is believed to only function as a 
signal which would be cleaved from TGF-omC upon processing by the 
endoplasmic reticulum. These principal domains may not be found in 

30 vivo with precisely the designated amino or carboxyl termini as some 
variation in cellular processing is to be expected. 

_PBTGF-a. .fragments generally (a) exhibit some Molo.gical 
activity in common with PRTGF-a, e.g., antibody cross-reacti vity or 
35 induction of anchorage independence, (b) exhibit substantial homology 
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with some region in PRTGF-a and (c) are at least about 5 amino acid 
residues long, and are ordinarily about from 10 to 130 residues in 
length. 

5 The invention provides methods for assaying such heretofore 

unidentified polypeptide fragments of PRTGF-a as TGF-aN and TGF-aC, 
and provides methods for assaying same without interference from 
other peptides encompassing or containing part of these sequences. 

10 Since the present invention now makes the complete amino 

acid sequence of PRTGF-a species known it is possible for the first 
time to raise antibodies against predetermined amino acid sequences 
of PRTGF-a species. Amino acid sequences representing PRTGF-a 
species fragments {other than TGF-a) are linked in immunogenic 

15 conjugates to proteins and then used to immunize animals. Such 

antibodies are useful in specific immunoassays for PRTGF-a species 
and in passive immunotherapy. 

The present invention further comprises essentially pure 
20 PRTGF-a species 1n which the PRTGF-a species are free of 

contaminants with which the are ordinarily associated in the 
non-recombinant cellular environment. Such contaminants are those 
which are normally present with the TGF-a as found in nature, e.g. 
in cells, cell exudates or body fluids, and include human serum 
25 albumin, gamma globulin, lipoproteins, and growth factors such as 

epidermal growth factor (EGF). Other proteins from the source from 
which the PRTGF-a species DNA is obtained may be present in the 
PRTGF-a species compositions herein, e.g. TGF-e and platelet-derived 
growth factor (PDGF), but here they will be known and present in 
30 predetermined amounts. For example, recombinant cell culture in 
non-human cells enables the production of human TGF-a which are 
absolutely free of other human proteins. 

Tns present invention is also directed to replicable DMA 
35 expression vehicles harboring DNA encoding PRTGF-a species in 
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expressible form, to microorganism strains or cell cultures 
transformed with such DNA and to microbial or cell cultures of such 
transformed strains or cultures, capable of producing PRTGF-a 
species. Still further, this invention is directed to methods for 
5 recombinant fermentative synthesis of PRTGF-a species in said 
microorganisms and cell cultures. 

DMA is provided that encodes PRTGF-a species and which, 
when expressed in recombinant or transformed culture, yields copious 
10 quantities of such PRTGF-a species. This DNA is novel because cDNA 
obtained by reverse transcription of mRNA from PRTGF-a species 
synthesizing cells contains no introns and is free of any flanking 
regions encoding other proteins homologous to the source of the DMA. 

15 Chromosomal DNA encoding PRTGF-a species is obtained by 

probing genomic DNA libraries with cDNA encoding PRTGF-a species. 
Chromosomal DMA is obtained free of its normal chromosomal 
environment and is therefore free of regions flanking the upstream 
5 l end of the first exon or flanking the 3' downstream end of the 

20 last exon which encodes other proteins homologous to the genomic DNA 
source but will contain introns since the genomic coding sequence 
for human precursor TGF-a contained in six distinct exons. Such DNA 
is useful for transforming mammalian cells to synthesize PRTGF-a 
species as is further described herein. 

25 

The isolated PRTGF-a species DNA is readily modified by 
substitution, deletion or insertion of nucleotides, thereby 
resulting in novel DNA sequences encoding PRTGF-a species (in the 
case of nucleotide substitutions that do not change the encoded 
30 amino acid sequence) or its sequence mutants. Modified DNA 

sequences which do not change the amino acid sequence are useful in 
enhancing the efficiency of PRTGF-a species expression in chosen 
host-vector systems, ^e.g- where a human codon -is- mutated to a codon 
preferred by an intended host cell. 

35 
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These novel DMA sequences or fragments thereof are labelled 
and used in hybridization assays for genetic material (DNA or mftNA) 
encoding PRTGF-a species. 

5 In processes for the synthesis of PRTGF-a species, DNA 

which encodes PRTGF-a species is ligated into a replicable 
(reproducible) vector, the vector used to transform host cells, the 
host cells cultured and PRTGF-a species recovered from the culture- 
The PRTGF-a species which are capable of synthesis herein include 

10 the TGF-a precursor, its fragments such as TGF-aC and TGF-a, and 
derivatives thereof including (a) fusion proteins wherein PRTGF-a 
species (including mature TGF-a) are linked to heterologous proteins 
or polypeptides by a peptide bond at the amino and/or carboxyl 
terminal amino acids of the TGF-a, (b) insertional or substitutional 

15 mutants of PRTGF-a species wherein one or more amino acid residues 
are mutated and (c) methionyl or modified methionyl, such as fonm/i 
methionyl or other blocked methionyl amino-terminal addition 
derivatives of the foregoing fusions, fragments or mutants - 

20 Vectors which comprise DNA encoding PRTGF-a species 

operably ligated to a heterologous secretory leader sequence 
(usually a signal from a protein homologous to the intended host 
organism are used to transform host cells- Host cell processing of 
the resulting PRTGF-a species fusion results in secretion of the 

25 PRTGF-a species without ami no-terminal methionyl or blocked 
methionyl . 

Also within the scope of this invention are derivatives of 
PRTGF-a species other than variations in amino acid sequence. Such 
30 derivatives are characterized by covalent or aggregative association 
with chemical moieties. The derivatives generally fall into three 
classes: Salts, side chain and terminal residue covalent 
modifications, and ad so rptf on complexes. - -■- 

35 After a PRTGF-a species has accumulated in direct 
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recombinant prokaryotic culture (other than as a fusion with a 
prokaryotic signal sequence) it is separated from other proteins by 
virtue either of its physical form as water insoluble refractile 
bodies or its considerable stability against denaturation. 
Generally, the insoluble matter in the prokaryotic culture is _ 
recovered, refractile bodies /which contain the PRTGF-a species) 
separated from insoluble cell debris and the refractile bodies 
solubilized. Optionally, glutathione treatment will follow this 
step in order to enhance proper refolding of the protein. PRTGF-a 
species from recombinant eukaryotic cell culture is water soluble 
and does not require resolubilization; it is purified using 
conventional methods heretofore employed in isolating PRTGF-a 
species from natural sources. 

Purified PRTGF-a species from recombinant cell culture are 
combined for therapeutic use with physiologically innocuous stabi- 
lizers and excipients and prepared in dosage form as by lyophiliza- 
tion 1n dosage vials or, preferably storage in aqueous preparations. 
The latter is feasible because PRTGF-a species are quite stable to 
thermal denaturation due to the prevalence of disulfide bonds in the 
molecule.- Alternatively, PRTGF-a species are incorporated into a 
polymer matrix for implantation into or attachment onto surgical 
sites, e.g. in bandages, thereby effecting a timed-release of the 
PRTGF-a species in a localized high gradient concentration. 

PRTGF-a species-containing compositions are administered to 
animals, particularly patients requiring accelerated tissue growth, 
in therapeutically effective doses. Suitable dosages will be 
apparent to the artisan in the therapeutic context. 

Brief Description of the Drawings 

Figure- 1- shows ol igonucleotides used as -hybridization 
probes for the detection of the DNA sequence for human precursor 
TGF-a- 
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Figure 2 shows a nucleotide sequence of the 180 bp Sau3 A- 
fragment of plasnrid pTGF 15-1, containing the exon coding for the 
first 33 amino acids of human TGF-a. The deduced amino acid 
sequence is shown- The sequence in capital letters is part of the 
TGF-o polypeptide, while the smaller letter type shows the amino 
acid sequence preceding TGF-a in the precursor. The arrows indicate 
the acceptor and donor site of intervening sequences as determined 
by comparison with the cDNA sequence. Some relevant restriction 
sites are indicated. 

Figure 3 shows the nucleotide sequence and deduced amino 
acid sequence of the cDNA contained in plasmid pTGF-Cl. The G-C 
tails flank the cDNA at both sides. Numbers above each line refer 
to the amino acid position, assuming that the single methionine 
constitutes the NH 2 - terminus. The amino acid sequence for 
TGF-a i E boxed and bounded at both sides with an 
Ala-Val rich sequence (overlined residues) . Some 
relevant restriction sites are ' indicated. 



Figure 4 shows a schematic representation of the 
construction pathway for plasmids pTEl, 2, 3, 4, 5, 6, 7, 8. 

Figure 5 is a schematic representation of the TGF-o 
expression plasmids pTE5 and pTE6 indicating the amino acid junction 
of the TGF-a fusion proteins. 

Figure 6 shows the results upon electrophoresis in a SDS-13 
percent polyacryl amide gel (79) of total lysates of E. coli 
containing the expression plasmids pTE2, pTE3, pTE5, or pTE6. The 
arrows indicate the TGF-a. fusion proteins. The values at the right 
show the positions of the protein markers. 

Figure 7 shows an SDS-polyacryl amide gel of the bacterial . . 
short TGF-a fusion protein, enriched by the acid-ethanol method. 
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The 68 residue TGF- a fusion protein migrates as a broad band (arrow) 
in this gel.- The values at the right show the positions of the 
reference protein markers. 

5 Figure 8A shows the competition of the bacterial TGF- n 

short fusion protein with 12 ^I -1 abelled EGF in a radioreceptor 
assay (solid line], performed and graphically represented as 
described [5, 14, 27). The calibration curve with EGF is shown as a 
dashed line. Figure 8B shows the EGF calibration curve. The 
10 ordinate shows the binding of 125 I-labelled EGF to the cells. 

Figure 9 shows soft-agar colony-forming activity of murine 
EGF, the bacterial TGF-a fusion protein, before and after cleavage 
with cyanogen bromide. The assay was performed in the presence of 

1 ^ human TGF-p using NRK cells, clone 49F, as described (14). The 
ordinate scores the number of colonies larger than 850 nm 2 , while 
the abscissa indicates the concentration of EGF or bacterial TGF-a, 
expressed in EGF equivalents (ng/ml), as determined in the EGF, 
while the solid lines give the results for the bacterial TGF-a 

20 fusion protein before (open circle) or after (closed circle) 
treatments with cyanogen bromide. 

Figure 10 depicts plasmid pyTE2 for yeast expression of 
TGF-a. This plasmid has the TGF-a sequence (dashed) with the 

25 preceding a-factor (MF- a ) prepro sequence under the transcriptional 
control of the a-factor promoter. The TGF-a sequence is followed by 
the "Able" gene 3* untranslated region and polyadenyl ation signal 
(open box). The TRP1 functions as the selection marker in yeast. 
The replication in yeast is assured by the presence of the 2[i 

30 replication origin. AP R : ampicillin resistance. The junction 

between the a-factor prepro sequence and the TGF-a sequence is shown 
at the right of the plasmid map. 
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Detailed Description 

PRTGF-a species are defined far the purposes herein as 
polypeptides other than EGF which have a substantial region showing 

5 functional amino acid homology with the TGF-a precursor amino acid 
sequence set forth in Fig- 3,^ or a fragment thereof. A candidate 
polypeptide is substantially homologous with PRTGF-a species as 
defined herein if greater than about 35 percent of the residues in 
the candidate correspond to a sequence within the PRTGF-a species 

10 sequence, without making conservative amino acid substitutions or 
introducing gaps. Ordinarily a candidate polypeptide, in addition 
to such functional homology, will be capable of exhibiting 
biological activity in common with its homologous PRTGF-a species. 

15 The degree of amino acid sequence homology which brings a 

polypeptide within the scope of the definition of PRTGF-a species 
will vary depending upon whether the homology between the candidate 
protein and PRTGF-a species falls within or without the PRTGF-a 
species regions responsible for biological activity; domains which 

20 are critical (a) for inducing morphological changes in target cells, 
Cb) for immunological cross-reactivity with antisera raised against 
PRTGF-a species as may occur in non-recombinant sources, or (c) for 
cell surface receptor binding should exhibit a high degree of 
homology in order to fall within the definition, while sequences not 

25 involved in maintaining "these functions show comparatively low 

homology. In addition, critical domains may exhibit one or more of 
these functions and yet remain homologous as defined herein if 
residues containing functionally similar amino acid side chains are 
substituted. Functionally similar refers to dominant characteristics 

30 of the side chains such as basic, neutral, hydrophobic or acid, or 
the presence or absence of steric bulk. 

Generally a polypeptide defined as PRTGF-a species will 
contain regions substantially homologous with the Fig. 3 protein or 
35 fragments thereof over a continuous domain of at least about from 10 
to 25 amino acid residues. 
2846Y ; 
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A significant factor in establishing the identity of a- 
polypeptide as a PRTGF-a species is the ability of antisera which 
are capable of substantially binding to the nonrecombinant 
counterpart to also bind to the activity of the polypeptide in 

5 question. However it will be recognized that immunological identity 
and identity as to other biological activity is not necessarily 
coextensive. For example, a neutralizing antibody for the receptor 
binding activity of the mature TGF-a of Fig. 3 may not bind a 
candidate protein because the neutralizing antibody happens to not 

10 be directed to specifically bind a site on mature TGF-a that is 
critical to its activity. Instead, the antibody may bind an 
innocuous region and exert its neutralizing effect by steric 
hinderance. Therefore a candidate protein mutated in this innocuous 
region might no longer bind the neutralizing antibody, but it would 

15 nonetheless fall within the definition of TGF-a in terms of its 
substantial homology with TGF-a. 

The language "capable" of biological activity means 
polypeptides which can be converted, as by enzymatic hydrolysis, 

20 from an inactive state anologous to a zymogen to a polypeptide 

fragment which exhibits the desired biological activity. Typically, 
inactive precursors will be fusion proteins in which PRTGF-a species 
is linked by a peptide bond at either terminus to a heterologous 
protein or fragment thereof. The sequence at this peptide bond is 

25 selected so as to be susceptible upon proteolytic hydrolysis to 
release PRTGF-a species, either in vivo or as part of a 
manufacturing protocol, in vitro- 

While PRTGF-a species ordinarily means human PRTGF-a 
3 q species, PRTGF-a species from sources such as murine, porcine, 
equine or bovine are included within the definition of PRTGF-a 
species so long as they otherwise meet the standards described above 
for homologous regions. Mature TGF-a in all cases, however, is 
"human" TGF-a. 

35 
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Derivatives of PRTGF-a species factor are included within 
the scope of the term. Derivatives include amino acid sequence 
mutants, glycosylated variants and covalent or aggregative 
conjugates with other chemical moieties. Covalent derivatives are 
5 prepared by linkage of functionalities to groups which are found in 
the PRTGF-a species amino acid side chains or at the N- or 
C-termini, by means known in the art. These derivatives may, for 
example, include: Aliphatic esters or amides of the carboxyl 
terminus or residues containing carboxyl side chains, e.g., asp3Z or 

10 49; 0-acyl derivatives of hydroxy! group-containing residues such as 
ser31", ser3, ser42, ser!56 or ser94; and N-acyl derivatives of the 
amino terminal amino acid or amino-group containing residues, e.g. 
lysine or arginine. The acyl group is selected from the group of 
alky! -moieties {including C3 to CIO normal alkyl), thereby forming 

15 alkanoyl species, and carbocyclic or heterocyclic compounds which 
forming aroyl species. The reactive groups preferably are 
difunctional compounds known per se for use in cross-linking 
proteins to insoluble matrices through reactive side groups. 

20 Covalent or aggregative derivatives are useful as reagents 

in immunoassay or for affinity purification procedures. For 
example, PRTGF-a species are insolubilized by covalent bonding to 
cyanogen bromide-activated Sepharose by methods known £er se or 
adsorbed to polyolefin surfaces {with or without glutaraldehyde 

25 cross-linking) for use in the assay or purification of anti -PRTGF-a 
species antibodies or cell surface receptors. PRTGF-a species also 
are labelled with a detectable group, e.g., radioiodinated by the 
chloramine T procedure, covalently bound to rare earth chelates or 
conjugated to another fluorescent moiety for use in diagnostic 

30 assays, especially for diagnosis of PRTGF-a species levels in 
biological samples by competitive-type immunoassays. 



TGF-clN and TGF-aC are useful as imtminogens (alone or as an 
immunogenic conjugate with a heterologous protein) for raising 
35 antibodies against the portions of the TGF-a precursor. In a 
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two-site sandwich specific receptor binding assay, the purpose of 
which is to distinguish the TGF-a precursor from mature TGF-a, 
TGF-aN and TGF-aC, anti-TGF-aN antibody is immobilized prior to or 
during the course of the assay procedure, a test sample is incubated 

5 with the antibody in order to permit the precursor to bind thereto, 
and then ant i-TGF-aC is incubated with the bound precursor. The 
anti-TGF-aC is labelled before incubation, for example by 
radioodi nation, or afterwards, for example by incubation with 
labelled IgG directed against the IgG of the species in which the 

10 anti-TGF-aC was raised. 

Antisera are raised against the predetermined PRTGF-a 
species fragments by crossl inking them to immunogenic proteins such 
as keyhole limpet hemocyanin (KLH) or serum albumin by the use of 

15 covalent agents such as glutaraldehyde or succinate anhydride 

immunizing suitable animals such as mice or rabbits by subcutaneous 
injection with conventional adjuvants, boosting as necessary and 
recovering antisera- Monoclonal antibodies are prepared from spleen 
cells of immunized mice in conventional fashion, e.g. 

20 immortalization with EB virus or by cell fusion. 

in a method for the determination of TGF-aC or TGF-aN 
without interference from mature TGF-a, antisera are raised against 
predetermined fragments at opposite ends of the TGF-aC or TGF-aN 

25 molecules and the resulting two antisera are employed in a sandwich 
assay as described above for the assay of the TGF-a precursor. In 
the case of TGF-aC, for example, the sequence (Cys) a Arg His Glu 
Lys Pro Ser Ala Leu Leu Lys Gly Arg Thr Ala (Cys)^, wherein a or 
b, but not both, is 1, is conjugated to KLH by disulfide bonds, and 

30 rabbits immunized with the conjugate. The antisera are harvested 
and stored. Similarly, rabbits are immunized against the TGF-aC 
sequence His Cys Glu Trp Cys Arg Ala Leu lie Cys Arg linked to KLH 

by the .use ..of. succinic anny.dridfi.iLt pH6. . These, two antisera. are .-. , 

useful in competitive or sandwich immunoassays. Alternatively, 

35 rabbits may be immunized against the entire TGF-aN or TGF-aC 
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polypeptides and two antisera for assay of each polypeptide selected 
based on their ability to not competitively inhibit one another for 
binding to TGF-afJ or TGF-aC as the case may be. Competitively 
inhibiting antisera are still useful in competitive-type assays for 

5 proteins encompassing the fragment against which antisera were 
raised, but one will be unable to readily distinguish such other 
proteins from the fragment. TGF-omC most conveniently is assayed in 
body fluid samples such as urine by a sequential or simultaneous 
sandwich immunoassay in which one of the TGF-aC antibodies is used 

10 in concert with an anti -mature TGF-a antibody. 

It will be understood that natural allelic variations in 
PRTGF-a species exist and occur from individual to individual. 
These variations may be demonstrated by one or more amino acid 
15 deletions, substitutions or insertions. Other mutants are 

predetermined variations in the PRTGF-a species sequence made by 
site directed mutagenesis of the PRTGF-a species DNA. 

The objective of site directed mutagenesis is to construct 
20 DNA that encodes PRTGF-a species as defined, but which species also 
exhibit improved properties and activity. Mutant PRTGF-a species 
are defined as a polypeptide otherwise falling within the homology 
definition for PRTGF-a species set forth herein but which have an 
amino acid sequence different from that of PRTGF-a species whether 
25 by way of deletion, substitution or insertion. For example, the 
lysine residue at position 96 or 97 may be mutated to histidine or 
another amino acid residue which no longer permits the protein to be 
proteolytically cleaved at this site. Similarly, cysteine 47, 55, 
60, 71, 73 and/or 82 could be replaced by serine, or the carboxyl 
30 and/or amino terminus of PRTGF-a deleted. It Is not necessary that 
mutants have all of the biological characteristics of PRTGF-a 
species where the mutants retain at least one epitopic site which is 
cross-reactive with antibody to PRTGF-a species. 
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While the mutation site is predetermined in the PRTGF-a 
species mutations of this invention it is not necessary that the 
mutation per se be predetermined. For example, in order to optimize 
the performance of the position 47, 55, 60, 71, 73 or 82 mutants 

5 random mutagenesis may be conducted at the cysteine codons and the 
expressed PRTGF-a species mutants screened for the optimal 
combination of biological activity and compatibility with 
intracellular conditions in prokaryotes, i.e., solubility upon 
direct expression. Techniques for making substitution mutations at 

10 predetermined sites in DNA having a known sequence are well known, 
for example M13 primer mutagenesis, but here the small size of 
PRTGF-a species facilitates chemical synthesis of the desired DNA 
having predetermined mutations. 

15 Mutagenesis is conducted by making amino acid insertions, 

usually on the order of about from 1 to 10 amino acid residues, or 
deletions of about from 1 to 30 residues. Substitutions, deletions, 
insertions or any subcombination may be combined to arrive at a 
final construct. Insertions include amino or carboxyl -terminal 

20 fusions, e.g. a hydrophobic extension added to the C- or M- terminus 
of mature TGF-a. Obviously, the mutations in the encoding DNA must 
not place the sequence out of reading frame and preferably will not 
create complementary regions that could produce secondary mRNA 
structure. 

25 

Not all mutations in the DNA which encode PRTGF-a species 
will be part of the final product. For example, a major class of 
DNA insertional mutations are those in which a heterologous 
secretory leader, or signal, has been appended to the N-terminus of 

30 the PRTGF-a species. Alternatively, N-terminal PRTGF-a species 
fusions with non-secretory heterologous polypeptides are 
contemplated by this invention where such polypeptides can be 
.cleaved, fat-example by cyanogen- bromide or enzymes, -from the .... - 
PRTGF-a species in order to yield unmethionylated PRTGF-a species. 

35 For example, in constructing a procaryotic expression vector the 
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E- coli alkaline phosphatase or heat stable enterotoxin II leaders 
are placed 5' from the PRTGF-a species sequence in reading frame 
therewith. Yeast invertase, alpha factor or acid phosphatase 
leaders are similarly used in yeast expression of unmethionylated 
5 PRTGF-a species. However, the native TGF-a precursor secretory 
leader may be recognized by hosts other than cells of its origin, 
most likely in cell culture of higher eukaryotic cells. When the 
secretory leader is "recognized" by the host, the fusion protein 
consisting of PRTGF-a species and the leader ordinarily is cleaved 
10 at the peptide bond joining PRTGF-a species and the signal in the 
events that lead to secretion of the PRTGF-a species. Thus, even 
though a mutant PRTGF-a species DNA is used to transform the host, 
and mutant prePRTGF-a species (a fusion) is synthesized as an 
intermediate, the resulting PRTGF-a species is not a fusion. 

15 

As used herein, transforming growth factor-B (TGF-b) 
denotes transforming growth factor of the B-type with the phenotype 
of naturally occurring TGF-b, e.g. capable of potentiating TGF-a or 
EGF fepidermal growth factor) for anchorage independent growth. 

20 

DNA encoding PRTGF-a species is covalently labelled with a 
detectable substance such as a fluorescent group, a radioactive atom 
or a chemi luminescent group by methods known p_er se. It is then 
used in conventional hybridization assays. Such assays are employed 
25 in identifying PRTGF-a species vectors and transformants as 

described in the Examples infra, or for in vitro diagnosis such as 
detecton of PRTGF-a species mRNA in tumor cells. 

The mRNA for PRTGF-a species, surprisingly, fs relatively 
30 rare. This makes the cDMA easy to overlook were one not apprised as 
to what to look for. However, once its presence is appreciated and 
completely complementary DNA made available, as is enabled by the 
disclosures herein, it is routine to screen cDNA libraries of tumor 
cells for PRTGF-a species cDNA using probes having complementary 
35 sequences. 
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PRTGF-a species are synthesized by host cells, transformed 
with expression vectors containing DNA which encodes PRTGF-a 
species. Expression vectors include vectors which together with a 
host cell are capable of expressing DNA sequences contained therein, 

5 where such sequences are operably linked to other sequences capable 
of effecting their expression. These vectors must be replicable in 
the host organisms either as epi somes or as an integral part of the 
chromosomal DNA. In general, expression vectors will be plasmids, 
circular doubled stranded DNA loops which, in their vector form, are 

10 not bound to the chromosome. In the present specification, 
"plasmid" and "vector" generally are used interchangeably as 
plasmids are the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression 
vectors, e.g. cotransformation vectors and viruses which serve 

15 equivalent functions. 

DNA regions are operably linked when they are functionally 
related to each other. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it 

20 is expressed as a preprotein which participates in the secretion of 
the polypeptide; a promoter 1s operably linked to a coding sequence 
if it controls the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is 
positioned so as to permit translation. Generally, operably linked 

25 means contiguous and, in the case of secretory leaders, contiguous 
and in reading phase. 

Recombinant host cells are cells which have been 
transformed with the above-described vectors. As defined herein, 
30 PRTGF-a species are produced in the amounts achieved by virtue of 
this transformation, rather than in such lesser amounts, or, more 
commonly, in such less than detectable amounts, as might be produced 
by the untransformed host. PRTGF-a species produced by such cells 
are referred to as recombinant PRTGF-a species. 
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Host Cell Cultures and Vectors 



The vectors and methods disclosed herein are suitable for 
use in host cells over a wide range of prokaryotic and eukaryotic 
organisms. 



In general, of course, prokaryotes are preferred for 
cloning of DNA sequences in constructing the vectors useful in the 
invention. For example, E- coli K12 strain 294 (ATCC No. 31446} is 
10 particularly useful. Other microbial strains which may be used 

include E. coli strains such as E. coli B, and E. coli X1776 (ATCC 
No- 31537). These examples are, of course, intended to be 
illustrative rather than limiting. 



15 Prokaryotes may also be used for expression, although our 

experience has shown that PRTGF-a species sequences generally are 
deposited in the prokaryotic cell cytoplasm as insoluble refractile 
bodies. These are readily recovered and resolubilized. The 
aforementioned strains, as well as E. coli W3110 (F-x-, 

20 prototrophic, ATCC 27325), bacilli such as Bacillus subtil us, and 
other enterobacteriaceae such as Salmonella typhimurium or Serratia 
marcescens , and various Pseudomonas species may be used. 

In general, plasraid vectors containing replicon and control 
25 sequences which are derived from species compatible with the host 

cell are used in connection with these hosts. The vector ordinarily 
carries a replication site, as well as marking sequences which are 
capable of providing phenotypic selection in transformed cells. For 
example, E. coli is typically transformed using pBR322, a plasmid 
30 derived from an E. coli species f55). pBR322 contains genes for 

ampicillin and tetracycline resistance and thus provides easy means 
for identifying transformed cells. The pBR322 plasmid, or other 
microbial plasmid- must also contain, or be modified to contain, 
promoters which can be used by the microbial organism for expression 
35 of its own protein- Those promoters most commonly used in 
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recombinant DNA construction include the e-lactamase {penicillinase) 
and lactose promoter systems (53, 72, 92} and a tryptophan (trp) 
promoter system {67, 93). While these are the most commonly used, 
other microbial promoters have been discovered and utilized, and 
details concerning their nucleotide sequences have been published, 
enabling a skilled worker to ligate them functionally with plasmid 
vectors (80). 

In addition to prokaryotes, eukaryotic microbes, such as 
yeast cultures, may also be used. Yeast express and secrete mature 
TGF-aat lower levels than bacteria, but the polypeptide is soluble 
unlike the direct expression product from E. coll . Saccharomyces 
cerevisiae , or common baker's yeast is the most commonly used among 
eukaryotic microorganisms, although a number of other strains are 
commonly available. For expression in Saccharomyces , the plasmid 
YRp7, for example, {81, 82, 83) is commonly used. This plasmid 
already contains the trpj gene which provides a selection marker for 
a mutant strain of yeast lacking the ability to grow in tryptophan, 
for example ATCC No. 44076 or PEP4-1 (84). The presence of the trpj 
lesion as a characteristic of the yeast host cell genome then 
provides an effective environment for detecting transformation by 
growth in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors include the 
promoters for 3-phosphoglycerate kinase (85) or other glycolytic 
enzymes (86, 87), such as enolase, glyceraldehyde-3-phosphoglycerate 
mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase, glucokinase and a-factor. In constructing suitable 
expression plasmids, the termination sequences associated with these 
genes are also ligated into the expression vector 3' of the sequence 
desired to be expressed to provide termination of the mRNA and 
polyadenylation. Other promoters, which have the additional 
advantage-of transcription controlled by growth conditions are the 
promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen 
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metabolism, and the aforementioned glyceraldehyde-3-phosphate 
dehydrogenase, and enzymes responsible for maltose and galactose 
utilization (87). Any plasmid vector containing yeast-compatible 
promoter, origin of replication and termination sequences is 
B suitable. 

In addition to microorganisms, cultures of cells derived 
from multicellular organisms may also be used as hosts. In 
principle, any such cell culture is workable, whether from 

10 vertebrate or invertebrate culture. However, interest has been 

greatest in vertebrate cells, and propagation of vertebrate cells in 
culture (tissue culture) has become a routine procedure in recent 
years (75). Examples of such useful host cell lines are VERO and 
HeLa cells, Chinese hamster ovary (CHO) cell lines, and KI38, BHK, 

15 COS- 7 and MDCK cell lines. Expression vectors for such cells 
ordinarily include (if necessary) an origin of replication, a 
promoter located in front of the PRTGF-a species sequence to be 
expressed, along with any necessary ribosome binding sites, RNA 
splice sites, polyadenylation site, and transcriptional terminator 

20 sequences. 

For use in mammalian cells, the control functions on the 
expression vectors are often provided by viral material. For 
example, commonly used promoters are derived from Polyoma, 

25 Adenovirus 2, and most frequently Simian Yirus 40 (SV40). The early 
and late promoters of SV40 virus are particularly useful because 
both are obtained easily from the virus as a fragment which also 
contains the SV40 viral origin of replication (88) incorporated 
herein by reference. Smaller or larger SV40 fragments stay also be 

30 used, provided there is included the approximately 250 bp sequence 

extending from the Hind III site toward the BgJI site located in the 
viral origin of replication. Further, it is also possible, and 
often desirable, to utilize promoter or control sequences normally 
associated with the desired gene sequence, provided such control 

35 sequences are compatible with the host cell systems. 
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An origin of replication may be provided either by 
construction of the vector to include an exogenous origin, such as 
may be derived from SV40 or other viral (e.g. Polyoma* Adeno, YSV, 
BPV, etc.) source, or may be provided by the host cell chromosomal 
5 replication mechanism. If the vector is integrated into the host 
cell chromosome, the latter is often sufficient. 

Rather than using vectors which contain viral origins of 
replication, one can tranform mammalian cells by the method of 

10 cotransformation with a selectable marker and the PRTGF-a species 
DNA. 'An example of a suitable selectable marker is dihydrofolate 
reductase (DHFR). In selecting a preferred host mammalian cell for 
transfection by vectors which comprise DNA sequences encoding both a 
PRTGF-a species and DHFR, it is appropriate to select the host 

15 according to the type of DHFR protein employed. If wild type DHFR 
protein is employed, it is preferable to select a host cell which is 
deficient in DHFR thus permitting the use of the DHFR coding sequence 
as a marker for successful transfection in selective medium which 
lacks hypoxanthine, glycine, and thymidine. An appropriate host 

20 cell in this case is the Chinese hamster ovary (CHO) cell line 

deficient in DHFR activity, prepared and propagated as described by 
Urlaub and Chasin, 1980, "Proc. Natl. Acad. Sci."(USA) 77: 4216. 
{^transformation is further described in U.S. patent 4,399,216; the 
procedures therein are adapted for use in PRTGF-a species synthesis 

25 by substitution of DNA encoding a PRTGF-a species sequence for the 
genomic or e-globin DNA used in the cited patent using appropriate 
synthetic linkers as required. 

On the other hand, if DNA encoding DHFR protein with low 
30 binding affinity for methotrexate (MTX) is used as the controlling 
sequence, it is not necessary to use DHFR resistant cells. Because 
the mutant DHFR is resistant to MTX, MTX containing media can be 
used as a means of selection provided that the host cells are 
"themselves MTX sensitive. Most eukaryotic cells which are capable 
35 of adsorbing MTX appear to be methotrexate sensitive. One such 
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useful cell line is a CHO line, CHQ-K1 {ATCC No. CCL 61). 

The method by which PRTGF-a species are recovered from cell 
culture will depend upon whether. or not they are expressed as a 
5 soluble protein or a refractile body tinsoluble aggregate). The 
latter will usually be the case with bacterial expression, where 
purification is readily accomplished because most cell proteins are 
soluble. 

10 Soluble PRTGF-a species are more readily recoverable from 

cell culture if expressed in a secretory host vector system, e.g. if 
linked to a bacterial or yeast secretory leader, or if a vertebrate 
cell line is transformed with the entire TGF-« precursor sequence 
including its normal signal sequence. 

15 

Soluble PRTGF-a species may be purified using, 
alkyl-sepharose chromatography, gel sieving, gel electrophoresis, or 
receptor-binging affinity chromatography using immoblized, 
antibodies, or receptors. 

20 

Compositions containing PRTGF-a species are prepared for 
administration to patients by mixing PRTGF-a species having the 
desired degree of purity with physiologically acceptable carriers, 
i.e., carriers which are nontoxic to recipients at the dosages and 
25 cor :entrations employed. Ordinarily, this will entail combining the 
PRTGF-a species with buffers, antioxidants such as ascorbic acid, 
low molecular weight {less than about 10 residues) polypeptides, 
proteins, amino acids, carbohydrates including glucose or dextrins, 
chelating agents such as EDTA, and other stabilizers and excipients. 

30 

Also administered to animals are compositions containing 
PRTGF-a species immunogenic conjugates or PRTGF-a species conjugates 
and antibodies capable of binding to PRTGF-a species, the former "to 
raise antibodies particularly for use in diagnostic kits, the latter 
35 to generate anti-antibody antisera. 
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The route of administration of such compositions is in 
accord with known methods, e.g. intravenous, intraperitoneal, 
intramuscular or intralesional infusion or injection of sterile 
therapeutic solutions, or by timed release systems as noted below. 

PRTGF-a species compositions may be administered from an 
implantable timed-release article. Examples of suitable systems 
include copolymers of L-glutamic acid and gamma ethyl -L-glutamate 
CU. Sidman et aK, 1983, "biopolymers" 22 (1): 547-556), poly 
(2-hydroxethyl-methacrylate) (R. Langer et al_- , 1981, "J. Biomed. 
Hater. Res." 15: 167-277 and R. Langer, 1982, "Chem. Tech." 12.: 
98-105) or ethylene vinyl acetate (R. Langer et al_., Id.). The 
article is implanted at surgical sites or over wounds. 
Alternatively, the compositions may be encapsulated in semipermeable 
microcapsules or liposomes for injection. 

The amount of said compositions that is administered will 
depend, for example, upon the route of administration, the target 
disease and the condition of the recipient. Intralesional 
injections will require less composition on a body weight basis than 
will intravenous infusion. Accordingly, it will be necessary for 
tne therapist to titer the dosage and modify the route of 
administration as required to obtain optimal activity as can be 
determined for example by biopsy of the target tissue or diagnostic 
assays. 

TGF-a is a potent bone resorption agent. Accordingly, it 
is therapeutically useful for this purpose. It is adminstered by 
injection, infusion or timed release and the dose is titered by 
following the plasma calcium ion levels; to induce bone resorption 
the dose is titered to generate hypercalcemia. In some patients 
just the converse is the proper therapeutic approach. These 
patients are those who suffer from carcinoma and, as the case with 
some tumors, hypercalcemia attendant the synthesis of PRTGF-a 
species by the tumor cells. These patients are identified by the 
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presence of above-normal concentrations of TGF-omC or TGF-a in 
urine, serum or surgically excised tumor tissue if available. The 
therapeutic regimen involves administering to such patients a TGF-a 
neutralizing agent such as anti-TGF-a {or its antigen-specific Fab 

5 region) or a TGF-a receptor such as the EGF receptor tor its 

TGF-a -bin ding amino terminal extracellular domain}. Methods are 
described above for producing antisera to selected TGF-a domains. 
The antibodies then are screened for their ability upon injection, 
infusion or timed release to correct the hypercalcemia of such 

10 patients- Alternatively, the antibody is generated in situ by 

immunizing the patient against TGF-a or a selected domain thereof as 
described above. 

The nucleotide and amino acid sequence for the EGF receptor 

15 are known (A. Ullrich et al_., May 1984, "Mature" 309: 418-425). In 
addition EGF receptor is obtainable from A431 epidermal carcinoma 
cells, a publicly available cell line having about 10-50 times more 
EGF receptor on their surface than most other cell types. A431 
cells also secrete a truncated receptor having on the extracellular 

20 EGF binding domain (A. Ullrich et al_., Id.). Either EGF receptor 
species is purified according to known methods (M. Waterfield et 
a1 M 1982 "J. Cell Biochem." ZIP : 149-161) or equivalent techniques 
known to those skilled in the art such as affinity chromatography on 
EGF linked to cyanogen bromide activated sepharose. They are 

25 formulated in phanmaceutically acceptable carriers such as sterile 
saline in concentrations therapeutically effective upon infusion to 
bind free TG r -o and thereby lower the serum calcium level. 
Therapeutic efficacy is monitored by the reduction in hypercalcemia 
or by assay of free versus bound TGF-a in fashion analogous to 

30 immunoassays presently employed for the assay of free thyroxine in 

plasma. The use of the EGF receptor or the TGF-a receptor for their 
7GF-a-binair; regions) is preferred over antibodies because of ease 
of manufacture. However* antibodies have the advantage that they 
can be selected having affinities greater than those of naturally 

35 occuring receptors. 
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Methods Employed 

If cells without formidable cell wall barriers are used as 
host cells, transfection is carried out by the calcium phosphate 
precipitation method as described 1n (89). However, other methods 
for introducing DMA Into cells such as by nuclear injection or by 
protoplast fusion may also be used. 

If prokaryotic cells or cells which contain substantial 
cell wall constructions are used, the preferred method of 
transfection is calcium treatment using calcium chloride as 
described by (90). 

Construction of suitable vectors containing the desired 
coding and control sequences employ standard ligation techniques. 
Isolated plasmids or DNA fragments are cleaved, tailored, and 
religated in the form desired to form the plasmids required. 

Cleavage is performed by treating with restriction enzyme 
(or enzymes) in suitable buffer. In general, about 1 ng plasmid or 
DNA fragments are used with about 1 unit of enzyme in about 20 pi of 
buffer solution. (Appropriate buffers and substrate amounts for 
particular restriction enzymes are specified by the manufacturer.) 
Incubation times of about 1 hour at 37°C are workable. After 
incubations, protein is removed by extraction with phenol and 
chloroform, and the nucleic acid is recovered from the aqueous 
fraction by precipitation with ethanol . 

If blunt ends are required; the preparation can be treated 
for 15 minutes at 15°C with 10 units of Polymerase I (Klenow), 
phenol chloroform extracted, and ethanol precipitated. 

Size separation of the cleaved fragments is often performed 
using 6 percent polyacryl amide gel, e.g. as described by Goeddel, 
et al_. (67). 
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For ligation, the desired components, suitably end tailored 
to provide correct matching, are treated with about 10 units T4 DNA 
ligase per 0.5 pg DNA. (When cleaved vectors are used as 
components, it may be useful to prevent religation of the cleaved 
vector by pretreatment with bacterial alkaline phosphatase.) 

For analysis to confirm correct sequences in plasmids 
constructed, the ligation mixtures are used to transform E_. coli K12 
strain 294 (ATCC 31446), and successful transformants selected fay 
ampicillin or tetracyclin resistance where appropriate. Plasmids 
from the transformants are prepared, analyzed by restriction enzymes 
and/or sequenced {45, 91). 

General Description of Preferred Embodiments 

Examples 

The following examples are intended to illustrate but not 
to limit the invention. In the examples here an £. coli host 
culture was employed as host cell culture. However, other 
eukaryotic and prokaryotic cells are suitable for the method of the 
invention as well . 

1. Isolation of a TGF-g specific genomic DMA clone . 

Isolation of a TGF-a gene is based upon specific 
hybridizatin with synthetic oligonucleotides. These probes were 
designed on the basis of a preliminary partial amino acid sequence 
for human TGF- a and are shown in Fig. 1. 

It has recently been shown that specific DNA sequences can 
be isolated from cDNA or genomic DNA libraries by hybridization 
under low stringency conditions with long synthetic oligonucleotides 
which contain many mismatches. This approach was successful in the 
detection of the gene for human insulin-like growth factor I as set 
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forth in U.S. Appln. Serial No. 501,351 filed June 6, 1983 (EPO 
Application No. 84.3037847), incorporated herein by reference. A 
similar approach was attempted for the isolation of the DNA sequence 
coding for human TGF-a. Two long oligonucleotides were synthesized 

5 for this purpose (Figure I). A 41-mer corresponds to a sequence, 
coding for amino acids 12 to 25, while the 48-mer is complementary 
to a sequence coding for amino acids 1 to 16. Since several 
different codons are possible for each amino acid, nucleotide choice 
of the oligonucleotides was based on the codon bias observed in 

10 human mRNAs (94). Also, the presence of multiple CpG dinucleotides 
was avoided. Since the 3' ends of the 41-mer and the 48-mer are 
complementary over a length of 15 residues, it is possible to 
hybridize both oligonucleotides and extend them with avian 
myeloblastosis virus (AMY) reverse transcriptase, thereby creating a 

15 double- stranded 74-raer. In addition to these long oligonucleotides, 
two sets of 14-mers were synthesized. Four pools of 14-mers, 
■ designated 1A, B, C and D are complementary to all possible codons 
for amino acids 5 to 9. Similarly, four other pools of 14-mers, 
named 2A through D correspond to amino acids 15 to 19. 

20 

These oligonucleotides shown in Figure 1 were used as 
hybridization probes for the detection of the DNA sequence for human 
TGF-a. The oligonucleotides were designed with reference to a 
partial amino acid sequence for human TGF-a (35) and were 
25 synthesized using the solid phase phosphotri ester method (38, 39). 

The 41-mer, 48-mer and the 14-mers were 5' -labeled with 
Y _32p_ ATp and p 0 -)y nuc ieotide kinase in a reaction mixture 
containing 70 mH Tris-HCl {pH 7.6),, 10 mM MgCl 2 , 5 mM 

30 dithiothreitol at 37°C for 30 min. The 74-mer was prepared by 

heating equimolar amounts of the 41-mer and the 48-mer for 5 min. at 
70°C in a 25 nl mixture of 105 mM Tris-HCl, pH 8.3, 140 mM KC1 , 50 
mM MgCl 2 a nd 210 mM e-mercaptoethanol and gradually cooling over a 
30 min interval to room temperature. These annealed oligonucleotides 

35 were subsequently extended to double-stranded radioacti vely-1 abeled 
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74-mers by adding dTTP and dGTP to 10 mM, 150 nCi each of 
a- 32 p-dATP and a - 32 P-dCTP and 80 units of AMV reverse 
transcriptase to a total volume of 70 yl. The incubation was for 30 
min. at 37°C. Unlabeled dATP and dCTP were then added to 10 raH and 
the reaction was allowed to proceed for 15 rain. The unincorporated 
nucleotide triphosphates were separated from, the hybridization 
probes by chromatography over a Sephadex G50 Superfine. 

Initially focus was on the isolation of TGF-a specific 
clones from cDNA libraries derived from mRNA from the human melanoma 
cell line A2058 (5, 34), which was used as a source for the purifi- 
cation of TGF-a. Extensive screenings with the 14-mers, the 41-mer 
and the 74-mer permitted the isolation of some hybridizing cDNA 
clones, which upon sequence analysis were found to be unrelated to 
TGF-a. Because of this lack of success, it was decided to search 
for the TGF-a gene in a human genomic DNA library contained in a x 
Charon 4A phage (40). About 7.5xl0 5 phages were screened by 
hybridization with the 41-mer, which was 5'-labeled with 32 P. In 
another experiment, an equal number of recombinant phages were 
screened by replica plating of the recombinant x phages onto nitro- 
cellulose filters (95) and hybridization with the radioactively- 
labeled 74-mer. These screenings resulted in the detection and 
isolation of 35 individual recombinant x phages which hybridized 
with the 41-mer and/or the 74-mer. DNA was isolated from all 35 
phages. Hybridization with the 41-mer, 74-mer, 48-mer and the 
14-mer pools 1A-D and 2A-D was assessed by "dot blot" analysis (43) 
for each recombinant x DNA. None of these isolated phage DMAs 
hybridized clearly with the 48-mer or the pools 1A-D, while about 
half of the DNAs showed some hybridization with a mixture of the 
14-mer pools 2A-D. 

The 41-mers, 48-mers and 74-mers were hybridized in 5X SSC 
(IX SSC = 0.15 M NaCl, 0.015 M sodium citrate), 5X Denhardt solution 
(IX Denhardt solution = 0.1 percent Ficoll, 0.1 percent 
polyvinpyrollidone, 0.1 percent bovine serum albumin, 20 percent 
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formamide and 50 ug/tnl sonicated salmon sperm DNA. 

After the filters were prehybridized for 2 hrs at 42°C, the 
heat denatured probe was added and hybridization took place at 42°C 

5 for 15-20 hrs. The filters were subsequently washed extensively in 
IX SSC, 0.1 percent SDS at 37°C. When the 14-roers were used as 
probes, the prehybridization 'for 2 hrs and the hybridization for 15 
hrs were at 37°C in 6X SSC, 0.5 percent NP40, 6 mH EDTA, IX Denhardt 
solution and 50 jig/ml salmon sperm DNA- Several washes were then 

10 performed in 6X SSC at room temperature. 

The extent of hybridization with the 41-mer and the 14-mers 
2A-D was evaluated by washing the hybridized "dot-blot" nitro- 
cellulose filters under increasingly higher stringency. This was 

15 done in order to restrict further the number of phages to be 

considered as potential candidates for further analysis. Twelve 
phages were selected for sequence determination on the basis of this 
evaluation. These phage DMAs were digested with BamHI, Hindlll or 
the combination of both enzymes and the fragments were separated on 

20 agarose gel . 

Southern analysis [44) of the phage DMAs, which hybridized 
with the 41-mer or the 14-mer pools 2A-D showed that the sequences 
hybridizing to either of the probes were localized within a same DMA 
25 segment. The hybridizing BamHI or Hindi 1 1 fragment of each phage 
DNA was subsequently subcloned into plasmid pBR322. This chimeric 
plasmids were in turn cleaved with the endonucleases Sau3AI , Rsal , 
or both. The mixture of fragments was separated on polyacryl amide 
and agarose gels and transferred onto nitrocellulose filters. 
Hybridization with the 41-mer permitted the identification of a 

30 

hybridizing small fragment for all 12 plasmids. These fragments 
were subsequently subcloned into M13 mp8 or mp9 (45) and their 
- nucleotide sequence was determined by the dideoxynucleotide chain 
termination method (46). 
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One of the plasmids, designated pTGF15-l, revealed the 
sequence coding for the first 33 amino acids of TGF- a , located 
within a 180 base pair Sau3AI fragment (Figure 2). pTGF15-l 
contains a 10.2 kilobasepair BarcHI fragment, derived from the 

5 recombinant phage a15. The codon for the 33rd amino acid is 

followed by a stop codon. The GT-di nucleotide at that posiion marks 
the donor site of an intervening sequence £47}. Restriction mapping 
of pTGF15-l and further sequence analysis (data not shown) showed 
that the Sau3AI fragment is located on a 670 bp Sad -Ball fragment 

10 and that the 5au3 AI site downstream of the splice donor site is also 
the recognition site for Bgll l enzyme. 

The nucleotide sequence of the 180 bp Sau3 A-f ragment of 
plasmid pTGF15-l, containing the exon coding for the first 33 amino 

15 acids of human TGF-a and the deduced amino acid sequence are shown 
in Figure 2. The sequence in capital letters is part of the TGF-a 
polypeptide, while the small letter type shows the amino acid 
sequence preceding TGF-a in the precursor. The arrows indicate the 
acceptor and donor site of the intervening sequences as determined 

20 by comparison with the cDNA sequence (see below}. 

Close examination of this nucleotide sequence shows that 33 
residues of the hybridizing 41-mer are homologous with the obtained 
TGF-a DMA sequence. Fourteen homologous bases are in a continuous 

25 stretch. It is not clear why the 48-mer did not hybridize 

significantly since 37 of the 48 residues are homologous. The 
perfect homology with one of the 14-mers of pool 2D results in a 
clear, although surprisingly weak, hybridization. The lack of 
hybridization with the 14-mers 1A-D is due to the presence of a 

30 codon for aspartic acid at position 7 in the mature TGF- a , instead 
of the lysine which was originally predicted. This difference 
results in the presence of two mismatching residues in these 14-mers. 

The isolated 180 bp Sau3AI fragment of pTGF15-l was 
35 subsequently used as a hybridization probe in a dot blot analysis 
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£43] of the previously isolated 35 recombinant phages. Five of the 
35 phage DNAs hybridized with this fragment. Southern analysis 
shows that they all contain the same hybridizing 670 bp Sari -Ball 
fragment. 

2. ' A TGF-g mRNA of about 5000 nucleotides long 

The above described results show that the genomic sequence 
coding for human TGF-a is interrupted by an intervening sequence. 
In order to obtain the full size coding sequence, the genomic 
Sad -Ball fragment containing the TGF-a exon was used to probe cDNA 
libraries derived from mRNA from the melanoma line A2058. Since 
these efforts were again unsuccessful, it was decided to search for 
another cell line as a source of the TGF-a mRNA. 

A collection of different mRNAs, extracted from a wide 
variety of tumor cell lines, was examined by electrophoresis on 
formaldehyde agarose gels {41} and "Northern" hybridization (42) 
with the TGF-a specific Sari -Ball fragment {data not shown). All 
cell lines showed a weakly, probably non-specifically hybridizing 
band at the position of the 28S ribosomal RNA, still present in 
these oligo dT-cellulose selected mRNA preparations. One cell line 
1072 F57, derived from a renal cell carcinoma, showed a clearly 
stronger hybridization signal at the 28S position, indicating the 
presence of TGF-a mRNA of about 4800-5000 nucleotides long. The 
related polypeptide EGF is also encoded by an mRNA of about 4800 
nucleotides long (48, 49). The primary translation product of this 
mRNA is an EGF precursor polypeptide, which is subsequently 
processed into several peptides, one of which is EGF. 

3. Isolation of a cDKIA coding for TGF-g 

. In ..order to jso.la.te .a cDNA containing. the complete sequence 
coding for human TGF- a , RNA was isolated from the above cell line. 
The polyadenylated mRNA fraction was isolated by absorption to oligo 
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dT-cellulose chromatography (50). cDNA was prepared by conventional 
methods {51-53), tailed with dC- homo polymers (54) and annealed into 
the PstI -linearized and dG-tailed pBR322 (55). Transformation (56) 
in £■ co1 * 294 (57) was performed using the high efficiency method 
5 of Hanahan (58) and gave rise to three cDtIA libraries. One library 
contained cDMAs which were primed with dT 1218 , while specifically 
primed cDNA synthesis from the synthetic 16-mer dCATG CTG GCTTGTCC T 
was utilized to prepare the two other libraries. This oligonucleo- 
tide is complementary to the downstream region (nucleotides 134 to 

10 149) of the TGF-a exon, contained within the 180 bp long Sau3AI 

restriction fragment of pTGF15-l (Figure 2). The bacterial colonies 
were screened (59) using the radioactively labeled TGF-a specific 
Sacl-Ball restriction fragment prepared from plasmid pTGFl5-l. Only 
one in 90,000 recombinant E. coli clones which were prepared by 

15 specifically primed cDNA synthesis, hybridized -with the probe. 
Restriction enzyme analysis of this plasmid, called pTGF-Cl, 
revealed the presence of three PstI fragments, the shortest of which 
is the 67 bp fragment also present in the TGF-a exon of pTGF15-l 
(Fig. 2). The three PstI fragments, which represent the cDNA insert 

20 of about 900 bp, were subsequently subcloned into M13mp8 (45) and 
sequenced with dideoxy chain termination method (45). The cDNA 
sequence of plasmid pTGF-Cl with its deduced amino acid sequence is 
shown in Fig. 3. 

25 Figure 3 shows the nucleotide sequence and deduced amino 

acid sequence of the cDNA contained in plasmid pTGF-Cl . The G-C 
tails flank the cDNA at both sides. The nucleotides are numbered 
beneath each line. Numbers above each line refer to the amino acid 
position, assuming that the single methionine constitutes the 

30 NH 2 -terminus. The amino acid sequence for TGF-a is boxed 
and bounded at both sides with an Ala-Val rich sequence 
(overlined residues). 

35 Examination of the nucleotide sequence shows that the cDNA 



2845Y 



- 35 - 



synthesis did not initiate at the RNA sequence specified by the 
primer (position 134-149 of the TGF-a exon, Fig. 2), but rather 
downstream of the position corresponding to the specific 
oglionucleotide. A gene fragment which is immediately downstream of 
the 3' end of this cDNA was subsequently isolated from a recombinant 
phage and did not reveal a. sequence which resembles the specific 
primer. It can therefore be assumed that a random cDNA initiation 
event generated this TGF-a cDNA. It is possible that the secondary 
structure of the TGF-a mRHA precluded specific hybridization with 

10 the oligonucleotide and therefore the specific cDNA initiation. 
Alignment of the sequences of the cDNA (Fig. 3) and the genomic 
fragment from pTGF15-l (Fig. 2) indicates the presence of a splice 
acceptor and donor site (47) in the genomic DNA. The TGF-a exon 
contained with the 180 bp SauA I fragment is thus 121 bp long (Figure 

15 2). 

The recognizable amino acid sequence for TGF-a establishes 
the reading frame in the cDNA sequence. The coding sequence ends at 
nucleotide 527 with a TGA as stop codon and is followed by part of 

20 the 3' untranslated region. The open reading frame continues up to 
the 5' end of the cDNA. It is thus possible that only a part of the 
sequence, coding for the TGF-a precursor, is located on this cDNA, 
especially since the mRNA is about 4800-5000 nucleotides long. It 
is, however, significant that the ATG codon for the single 

25 methionine residue is preceded by an A at the -3 position and is 

immediately followed by a G residue. These features are 
characteristic of the initiation codons in most mRNAs of higher 
eukaryotes (60). In addition, the sequence between positions 8 and 
18 is very characteristic for a hydrophobic core (61) of a signal 

30 peptide involved in protein secretion from the cells. Comparison 

with other signal sequences (61-62) suggests that the cleavage by 
the signal peptidase could occur following the Ala at position 19, 
Cys at position 20 or the Ala at position 22, if this sequence 
Indeed- rep resents the signal peptide. However, the assignment of 

35 this single methionine as the start of the TGF-a precursor implies 
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that the 3' untranslated sequence of the mRNA would probably have an 
unusually large length of about 4,000 nucleotides. 

The cDNA sequence, shown in Figure 3, reveals the complete 
5 DNA sequence for human TGF- a embedded in a larger coding sequence 
for the precursor protein. Direct amino acid analysis of the rat, 
mouse and human TGF- a (27, 35) has revealed the Val-Val sequence at 
the NH^terminus. Based on the polypeptide length of 50 amino 
acids and on the sequenced carboxyl terminus of the rate and mouse 

10 TGF-a ends with the Leu-Ala residues at positions 88 and 89 (Fig, 
3). In order to generate the 50 amino acid long TGF-a, proteolytic 
cleavage must occur at both the amino- and carboxyl -termini between 
alanine and valine residues. This Ala-Val dimer at the 
NH^terminus, which is located within the sequence 

!5 Yal-Ala-Ala-Ala-Yal-Val, is very similar to the Ala-Yal-Val -Ala-Ala 
sequence found at the carboxyl end. A protease with this remarkable 
specificity and which could thus be responsible for the proteolytic 
processing of the TGF-« precursor has not yet been described. 

20 The complete sequence coding for TGF- a has now been 

determined in the cDNA derived from a renal cell carcinoma and in a 
gene fragment isolated from a genomic library which was derived from 
a normal fetal liver (40). Both sequences are identical, indicating 
that there are no coding differences between the TGF-a genes in both 

25 sources. 

Alternatively, having established the DNA coding sequence 
for TGF-a (See Figure 3), the gene can be synthesized using 
conventional methods, such as those described in references (38, 39). 

30 

The deduced amino acid sequence for the precursor of TGF-a 
reveals a very hydrophobic region beginning at 20 amino acids 
downstream of the carboxyl terminus of 50 amino acid TGF-a. These 
residues 103 to 121 consist almost exclusively of leucines, 
35 isoleucines and valines. The sequence from amino acid 118 to the 
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carboxyl terminus of the precursor polypeptide is remarkably rich in 
cysteines. This sequence of 42 amino acids contains 8 cysteines, 4 
of which are clustered 1n pairs. This cysteine-rich sequence could 
possibly constitute a biologically active polypeptide. It can as 
5 yet only be speculated how this peptide might be cleaved from the 

precursor molecule. Several polypeptide hormones are synthesized as 
larger precursors and are usually bounded by pairs of basic amino 
acids. The Lys-Lys residues at positions 96-97 could possibly be 
the site of proteolytic cleavage as in the case of preproenkephalin 
qo (63-64), the calcitonin precursor (65) and the corticotropin-B- 
lipotropin precursor (66). 

Gel filtration analysis suggests the existence of some 
larger TGFs-a with estimated molecular weights of 10 to 23 

15 kilodaltons {1, 5, 22, 28, 29, 30, 33, 34). The nature of these 

larger TGFs-a is unknown. It is possible that several related genes 
coding for TGFs-a are present in the genome. However, Southern 
hybridizations (44) of total human genomic DNA with the 180 bp 
Sau3AI and the 670 bp Sad -Ball fragment of pTGF15-l did not reveal 

20 the presence of multiple genes. Alternatively, the nature of some 
of these larger TGFs might be explained by different types of post 
translational processing of the precursor molecules. It is also 
possible that aggregation of the TGFs-a with some other proteins 
could result in an apparent larger molecular weight. 

25 

4. Expression of TGF-g in E. coli 

TGF-a is made in minute amounts by many tumor cells. 
Indeed, only 1.5 y g of the small TGF-a species has been isolated 
^ from 136 liters of culture supernatant of the melanoma cell line 
A2058, which is considered an "overproducer" of TGF- a (34). This 
very low availability of the TGF-a from cell culture has hampered 
its biological characterization. In order to facilitate these 
studies the synthesis of human TGF- a in E_. coli was pursued. 

35 
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As the sequence coding for TGF-q is embedded in a 
precursor, we introduced a start codon in front and a stop codon 
behind the coding sequence- The start codin is preceded by an EcoR I 
recognition site and the stop codon is followed by a Bglll site, so 
that the TGF-a sequence becomes available as a portable EcoR I -Bgll l 
restriction fragment (Fig. 4). 



TGF-a was expressed as part of different fusion proteins in 
E_. col i in a way similar to human somotostatin (72), insulin (73) 

10 and desacetylthymosin-al (74). In these latter cases the mature 
protein can be cleaved from the fusion polypeptide using cyanogen 
bromide. This chemical treatment results in the specific cleavage 
behind the methionine residue (72), which was introduced to connect 
the front part of the fusion protein with the mature polypeptide. 

15 t wo piasmids were designed so that the TGF-a coding sequence and its 
preceding methionine codon are linked at the EcoR I site to the 
sequence for the front part of a trp leader t- trp E fusion protein 
(trp aLE 1413, ref- 76). The expression of this fusion protein is 
under the control of the trp promoter using the trp leader ribosome 

20 binding sequence. In the case of the expression plasmid pTE6, the 
sequence for the first 190 amino acids of this trp aLE 1413 fusion 
protein, including several cysteine residues, is linked in frame 
with the TGF-a sequence and followed by a stop codon. In plasmid 
pTE5 the sequence coding for only the first 17 amino acids of this 

25 trp LE fusion protein precedes the TGF-a DMA sequence. This stretch 
of 17 amino acids does not contain any cysteines, so that the 
presence of this NHg-terminus would probably not affect the 
disulfide bond formation of the expressed protein (Fig. 5a). In 
both cases cleavage with cyanogen bromide will release mature TGF-a 

30 because of the presence of a methionine codon in front of the TGF-a 
coding sequence. 



Figure 4 shows a schematic representation of how several 
piasmids for expression of TGF-a (amino acids 40-89, Fig. 3), with 
35 or without its downstream sequence (amino acids 90-160, Fig. 3) were 
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constructed. Restriction mapping indicated that in pTGF15-l, the 
180 bp Sau3AI -Bgll l fragment which carries the TGF-a exon is 
contained within a 380 bp PvuII-Smal fragment. This latter fragment 
was Isolated, denatured and renatured in the presence of the 
5 synthetic oligonucleotide dCATGGTGGTGTCCCATTTT, which was B'-labeled 
using r 32 P-ATP and T4" kinase. _E. coli DNA polymerase I Klenow 
fragment was added to the mixture to catalyze the repair synthesis 
as described (67). Using this primer repair technique the CATS 
sequence was introduced in front of the coding sequence for TGF-a* 

10 The reaction products were then cut with Bgll l and the 130 bp 
fragment containing the partial TGF-a sequence was isolated by 
polyacryl amide gel electrophoresis. Plasmid pYG121, which is 
essentially identical to the expression plasmid pIFN-eZ (pBoIFN-p2) 
(77) except that the bovine IFN-@2 DNA sequence is replaced by a 

15 short synthetic DNA fragment, was opened at its unique EcoRI site, 
filled in with E. coli DNA polymerase I (Klenow fragment), and cut 
with Bglll . The 130 bp TGF-a fragment was ligated into this vector, 
thus restoring the EcoR I site. The resulting plasmid is called pTEl. 

20 The 130 bp EcoRI -Bglll restriction fragment which contains 

the sequence coding for the first 33 amino acids was isolated from 
pTEl and ligated to the 350 bp Bglll -BamHI fragment of pFIFtrp 3 69 
(67), which contains the front part of the tetracycline resistance 
. gene, and to the large vector fragment of the EcoR I and BamH I cut 

25 pINCV-PA13-33 (i.e., pINCV (78), containing a plasminogen activator 
cDNA insert). The resulting plasmid pTE2 contains the sequence for 
the first 33 amino acids of TGF-a linked via an EcoR I site to the 
sequence coding for the NH 2 -terminal 17 amino acids of the 
aforementioned trp ALE 1413 (ref . T 76) fusion protein. 

30 

The 130 bp RI -Bgll l fragment containing the TGF-a sequence 
was also linked to the sequence coding for the first 190 amino acids 
of the aforementioned trp LE fusion polypeptide. This plasmid, 
pTE3, was constructed by ligation of the 130 bp RI -Bglll fragment of 
35 pTEl to the Bglll -Bam fragment of pFIFtrp369 and the large EcoRI 
and BamHI vector fragment of pNCV (51). 
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The plasmids for the expression of the complete TGF- a 
sequence as a short fusion was obtained as follows. pTE2 was cut at 
its unique Pstl and BglHI site and the larger fragment was 
isolated. The cDHA plasmid pTGF-Cl was cleaved with Pst! fposftion 

5 223, Fig. 3) and Avail (position 301, Fig. 3) and the 78 bp TGF-a 
specific fragment was isolated. Both fragments were ligated in the 
presence of the partially complementary oligonucleotides 
dGACCTCCTGGCCTAA and dGATGTTAGGCCAGGAG. These oligomers introduce a 
stop behind the TGF-a coding sequence and connect the Avai l site 

10 with the BgTJ I site, The resulting plasmid is pTE5. Plasmid pTE6 
has the complete coding sequence for the first 190 amino acids of 
the trp LE fusion. It was made by ligation of the TGF-a specific 
520 bp long EcoRI -BamHI fragment into the large EcoRI -BamHI fragment 
of pNCV (51). The trp promoter controls the synthesis of the TGF-a 

15 short and long fusion proteins from plasmids pTE5 and pTE6, and the 
expression of the Tet R gene. 

For the direct expression of the DNA sequence (Fig. 4d) for 
TGF-a, with its connecting downstream coding sequence, plasmid 

20 XAP-PA2 was cleaved at the unique Xbal and BgTII sites and the large 
vector fragment was isolated. XAP-PA2 is a plasmid essentially 
similar to pFIF-trp69 (67) except that the IFN-e cDHA sequence is 
replaced by the cDNA sequence for human tissue plasminogen 
activator, modified as in plasmid pt-PA trpl2 [70}, and that the 641 

25 bp Aval-PvuII fragment downstream of the Tet R gene was deleted. 
In addition, the 70 bp Xbal and Bgll l fragment was isolated 
containing the start of the TGF-a sequence from pTEl and the 345 bp 
PstI -Sau3AI fragment which contains the rest of the coding sequence 
from pTGF-Cl. The three fragments were ligated. The resulting 

30 plasmid, pTE4, contains the coding sequence preceded by a start 
codon, as an EcoRI -Bglll fragment under the control of the trp 
promoter. 

In addition, to the plasmid pTE4 for direct expression, we 
35 also envisaged the expression of this coding sequence as a fusion 
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protein with the NH 2 -temrfnal part of the trp leader protein. The 
TGF-a specific Fstl-Bglll fragment of 350 bp long was therefore 
isolated from pTE4 and ligated into the large Pst-BgUI fragment of 
plasmid pTE2. The resulting plasmid is called pTE7. Alternatively, 

5 the TGF-a sequence containing EcoRI -BamHI fragment of pT£4 was 

ligated into the large EcoRI -BanHI fragment of pTE3, giving rise to 
plasmid pTE8. Plasraids pTE7 and pTE8 contain the TGF-a ONA sequence 
and its downstream coding sequence linked to the first 17 or 190 
amino acids, respectively, of the trp LE 1413 fusion protein (ref. 

10 76). The ligation of these sequences is via an EcoRI site which is 
followed by the ATG codon for methionine. The presence of the 
methionine in front of the TGF-a sequence makes it possible to 
cleave the fusion protein specifically behind this residue. 

15 Restriction enzymes were purchased from New England Biolabs 

or Bethesda Research Laboratories. T4 kinase was from New England 
Nuclear, T4 ONA ligase was from Bethesda Research Laboratories, and 
E. coli DNA polymerase I {Klenow fragment) was from New England 
Nuclear or Boehringer Mannheim. All enzymes were used essentially 

20 as recommended by the manufacturers. 

T4 kinase reactions were performed in 70 mM Tris-Cl pK 7.6, 
10 mM MgCl 2 and 5mH dithiothrei tol . Ligations were in 20 mM 
Tris-Cl, pH 7.6, 50 mM NaCl , 6 mM MgClg, 10 mM dithiothrei tol , 0.5 
25 mM ATP. Restriction digestions were in 6 mM Tris-Hcl, 6 mM HgClg, 
6 mM e-mercaptoethanol , while the "fill-in" reactions using 
DNA-polymerase I (Klenow fragment) were done in the same buffer 
supplemented with 20 y M of each of the 4 dNTPs. 



Fig. 5 fives a schematic representation of plasmids pTE5 
and pTE6 and shows the amino acid sequence connecting TGF- a with the 
preceding trp LE fusion protein sequence. 

The introduction of these plasmids pTE5 and pTE6 into 
E. coli H3110 and the subsequent induction of the trp promoter 
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results in the synthesis of high levels of the TGF-a fusion 
proteins. The long fusion protein, expressed from plasmid pTE6, 
constitutes the major protein by far in a total bacterial lysate. 
The short fusion protein encoded by plasmid pTE5 is not synthesized 
5 in such high abundance but is nevertheless easily detectable as a 
prominent band in the E. coli lysate (Fig. 6). 

Fig. 6 shows the results after electrophoresis in a SDS-13 
percent polyacryl amide gel (79) of total lysates of E. coli 

10 containing the expression plasmids pTE2, pTE3, pTE5 or pTE6. 

E. cpli W311Q, transformed with these plasmids was grown at 37°C in 
M9 medium containing casamino acids and tetracycline (5 tig/ml ) to 
0D,j[jq= o.l. Expression from the trp promoter was boosted by 
adding indolacetic acid to 20 wg/ml . Three ml of bacteria were 

15 harvested at 0D 55Q = 0.1 before induction, while a similar number 
of bacteria were collected at OD^^ 1-0 after induction in the 
case of pTE2 and pTE5 or 0D 55Q = 0.7 for pTE3 and pTE6. The 
pelleted bacteria were resuspended in 30 til 10 mM Tris-KCl pH 7.5, 1 
mM EDTA and 3 nl 1 M e-mercaptoethnol and 6 nl 10 percent SDS were 

20 added. The mixtures were heated for 2 min at 95°C and 300 nl cold 
acetone was added. The acetone precipitate was pelleted, dissolved 
in 25 bl SDS-loading buffer {5 percent B-mercaptoethanol, 4 percent 
SDS, 0.125 M Tris-HCl, pH 6.8, 20 percent glycerol) and after 
heating (2 rain. 95"C) loaded on an SDS-13 percent polyacryl amide 

25 gel. The gel was stained with Coomassie Brilliant Blue. Bacterial 
lysates before and after induction are shown. The arrow marks the 
TGF-a fusion protein. The position of the molecular weight markers 
is shown at the right of Figure 6. 

30 The short TGF-a fusion proteins from E_. coli transformed 

with pTE5 was substantially enriched in order to determine the 
bio :ogical activity. Two procedures, not involving column 
chromatography, resulted in "a purity of 80 to 90 percent. One 
method is based on the observation that the short TGF-a fusion 

35 protein is, in contrast to many other proteins, apparently insoluble 
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in the presence of 0.4 M NaCl and 0.5 percent NP40. The precipitate 
can then be solubilized in 8 H urea and dialyzed to 1 M acetic 
add. The soluble fraction contains mostly the TGF-a fusion 
protein. Alternatively, the bacteria were sonicated in 70 percent 
5 ethanol acidified withHCl, and the TGF- a fusion protein was 
precipitated from the cleared supernatant by an ether-ethanol 
precipitation and dissolved in 1 H acetic acid essentially as 
described (18). Gel electrophoretic analysis showed that both 
procedures resulted in an equally efficient enrichment (Fig. 7). 

10 

Figure 7 shows an SDS-polyacryl amide gel (79) of the 
bacterial short TGF-a fusion protein, enriched by the acid-ethanol 
method. The 68 amino acid long TGF-a fusion protein migrates as a 
broad band (arrow) in this gel. The enrichment of the protein by 
1S the NP40-NaCl method is very similar. 

The bacterial TGF-« short fusion protein thus obtained was 
tested in two different assays. It has been shown that natural 
TGF-a competes with EGF for the same receptor (5, 8, 9). This has 

20 led to the development of a fast and quantitative binding assay for 
TGF-a, based on the competition with 125 I -labelled EGF (5, 14, 
27). The results from binding experiments using NRK cells (15, 19, 
27) indicate unambiguously that the TGF-a short fusion protein, as 
well as the TGF-a generated by cyanogen bromide cleavage (not shown) 

25 from the same fusion protein, bind to the EGF-receptor. However, 
the binding of the short TGF-a fusion protein is only 0.5 - 1 
percent of the expected value, if one assumes quantitatively 
equivalent binding of TGF- 0 and EGF for the same receptor under the 
experimental conditions. It is possible that the low value may be 

3 q due to an intrinsically lower binding affinity of human TGF-«, or to 
the presence of molecules with aberrant configurations or to the use 
of binding conditions which are not optimal for the bacterial TGF-a. 

The A panel of Figure 8 snows "the competition of the 

3S bacterial TGF-a short fusion with 125 I -label led EGF in a 
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radioreceptor assay (solid line). The calibration curve with EGF is 
shown as a dashed line. Panel B shows the EGF calibration curve 
separately. 

5 The biological activity of TGF-a can be measured by its 

ability to induce anchorage independence of non-transformed cells, 
such as NRK cells. The presence of TGF-a or EGF induces the 
formation of colonies. The number and size of these colonies is 
strongly increased in the presence of TGF-b (13-14). The bacterial 

10 TGF-a, purified as indicated before, was tested for the ability to 
induce anchorage independence, using NRK cells (140). Figure 9 also 
shows soft-agar colony-forming activity of murine EGF and the 
bacterial TGF-a fusion protein before and after cleavage with 
cyanogen bromide (75). The assay was performed in the presence of 

15 TGF-b with NRK cells, clone 49F, as described (14). The ordinate 
scores the number of colonies larger than 850 nm 2 , while the 
abscissa indicates the concentrations of EGF or bacterial TGF-a, 
expressed in EGF equivalents (ng/ml), as determined in the EGF 
receptor binding assay. The dashed line shows the curve for EGF, 

20 while the solid lines give the results for the bacterial TGF-a 
fusion protein, before (o) or after (o) cleavage with cyanogen 
bromide. These results clearly show that the TGF-a short fusion 
triggers colony formation in soft agar in the presence of TGF-b. It 
is remarkable that in these assays the bacterial TGF-a is about 20 

25 to 30 times more active relative to EGF than it is in the 

radioreceptor assay. This quantitative difference was also apparent 
with the cyanogen bromide cleaved TGF-a fusion protein. The number 
and the size of colonies, induced by the bacterial TGF-a, in the 
absence of TGF-p, is much smaller than in the presence of TGF-b, but 

30 also under these assay conditions the quantitative differences 
between the bacterial TGF-a and EGF remain unchanged. 

The results from both types of assays show that the 
bacterial TGF-a fusion protein, with its additional 17 
35 NH 2 -terminal amino acids, and the cyanogen bromide cleaved protein 
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compete with EGF and can induce anchorage independence of NRK 
cells. The ability to induce colony formation in soft agar is much 
higher than expected on the basis of the radioreceptor assay. In 
all cases TGF-a has 20 to 30 fold higher activity in soft agar than 
5 . it has in the radioreceptor assay (relative to EGF). 

Hassague et aj_. (12) have presented evidence that TGF-a 
binds not only to the EGF receptor but also to a 60 kd TGF-a 
specific receptor. It is possible that the binding to this latter 

10 receptor medites in part the induction of the anchorage-dependent 
character by TGF- a . If so, the EGF radioreceptor assay may not be 
predictive in an absolute manner for the colony formation in soft 
agar. It could also be possible that the binding characteristics of 
TGF-a to either of these receptors are not the same. In 

15 contradiction to this, Carpenter et aK (8) have recently shown that 
the induction of anchorage independence by TGF-a can be blocked by 
the presence of antisera to the. EGF receptor, indicating that the 
binding to this receptor is required for the appearance of colonies 
in soft agar. It is also possible that the binding of the bacterial 

20 TGF-a to the EGF receptor is more efficient under conditions for the 
soft agar assay than during the radioreceptor assay, and that this 
may explain the quantitative differences observed between both 
assays. 

25 5 - Expression of TGF-g in Yeast 

A plasmid was designed which was aimed at the expression of 
TGF-a in yeast and subsequent secretion into the yeast medium. For 
this purpose we explored the use of the gene coding for the mating 
facto r-a in yeast. This a-f actor is secreted from the yeast S. 

3 q cerevisiae and plays an important physiological role in the mating 

process. The gene for this a-factor has been isolated and codes for 
a large precursor. The p rep ro-a-f actor polypeptide comprises an 
amino- terminal signal peptide needed during the secretion process, 
and 4 identical a-factor peptide units. Release of these a-factor 

35 peptides most likely involves a proteolytic cleavage of the 
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precursor at the dibasic lys-Arg residues, located downstream of the 
signal peptide and in front of the a-factor units (J- Kurjan et al_-, 
1982, "Cell" 30: 933; Singh et a]... 1983, "Nucl . Acids Res." Tl_: 
4049. A T6F-a expression plasid (pyTE2) was constructed in which 
the sequence coding for the 50 amino acid TGF-a and its flanking 
stop codon was introduced in frame immediately following the codons 
for the yeast a-factor Lys-Arg dipeptide. The sequence for the 
amino terminal part of the prepro-a-f actor including the signal 
peptide is retained. The expression of this TGF-a fusion 
polypeptide is under the control of the a-factor promoter. Suitable 
starting plasmids for this and equivalent constructions are 
described in EP 123.544A. Expression plasmid pyTE2 (Fig. 10), also 
contains a functional yeast replication origin derived from the 2v 
plasmid (<J. Hartley et alk, 1980, "Nature" 286: 860), a 
transcriptional terminator and polyadenylation site from the "Able* 
gene (J. Hartley et a]_., Id.) and the TRP-1 selection marker (G. 
Tschumper et aT_. , 1980 "Gene" Jj): 157-166). 

Saccharomyces cerevisiae strain 20B-112 (E. Jones, 1976, 
"Genetics" 85: 23) was transformed with pyTE2. The medium of the 
transformed yeast was assayed both 1n the radioreceptor assay and in 
the soft agar colony formation. Using both assays, biologically 
active TGF-a could be detected in the yeast medium. Further 
analysis showed that about 8 ng of TGF-a can be recovered per ml of 
medium and that following the synthesis in yeast more than 90 
percent of the biologically active TGF-a is secreted. It is likely 
that the secreted TGF-a has the proper disulfide bond configuration, 
since refolding the disulfide bridges is not needed for activity. 
Using a similar a-factor based expression vector, Brake et aTN, 
1984, n Proc. Natl. Acad. Sci. USA" EH: 4642, have recently been able 
to express and secrete human EGF from yeast. 

6. Expression of a TGF-gC Polypeptide 

pTE4 was digested with Bgll and BamHI and the 
TGF-aC-containing fragment recovered. An oligonucleotide primer 
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having the sequence 

MetnelleThrCysValLeu 
G ATC G* AATTCATG ATC ATCAC ATG TG TGC TG 

EcoRI 

was prepared, representing an EcoRI site, a methionine codon and the 
first six residues of a TGF-aC polypeptide starting at lie 115 of 
the precursor. The regions in the TGF-aC-contaim'ng fragment which 
were 5' to this primer were deleted by primer extension following 
conventional procedures. The DNA encoding the TGF-aC polypeptide 
was recovered by gel electrophoresis of an EcoRI and Bgll l digest. 
Alternatively, this DNA could be prepared by organic synthesis. 

pTE5 was digested with EcoRI and Bglll, and the large 
vector fragment recovered. The recovered pTE5 fragment and the 
EcoRI -Bglll TGF-aC fragment from the previous steps were ligated 
with T4 ligase, the mixture used to transform E. coli and the 
bacteria cultured as described above. TGF-aC was relatively toxic 
t0 E. coli . Better yields might be obtained by modifying the 
oligonucleotide so that the E. coli STII or alkaline phosphatase 
signals are inserted in place of the ATG start codon, or by 
expressing the TGF-aC gene in a mammalian cell transformation 
host-vector system. 
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CLAIMS 

1. A bandage impregnated with a product comprising human 
mature TGF-a, TGF-aC f TGF-aN or TGF-amC essentially free of 

5 other human proteins. 

2. A composition comprising precursor TGF-a or a fragment 
thereof produced by heterologous recombinant host cells. 

10 3. A composition of claim 2 wherein the precursor of 
TGF-a or its fragment is a mutant. 

4. A composition of claim 2 conjugated to a heterologous 
polypeptide which is heterologous to the organism from which 

15 was obtained the recombinant DNA encoding the precursor 

TGF-a or its fragment, the heterologous polypeptide being a 
microbial signal sequence. 

5. A composition of claim 2 wherein the precursor TGF-a 
2 0 or its fragment is human and additionally comprising a 

predetermined amount of a human protein. 

6. A composition of claim 2 which is labelled with a 
detectable moiety. 

25 

7. A composition of claim 2 which further comprises a 
pharmaceutically acceptable carrier and a therapeutically 
effective amount of precursor TGF-a or its fragment. 

30 8. A composition of claim 7 which is sterile and which 
further comprises a predetermined amount of TGF-B . 

9. A composition of claim 8 for topical application to a 
wound . ■ • - - — •- 

35 
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10. A DNA sequence encoding precursor TGF-a or a fragment 
thereof, provided that when the fragment is mature TGF-a 
then the mature TGF-a is human. 

5 11. The DNA sequence of claim 10 ligated to DNA encoding a 
heterologous polypeptide. 

12. A replicable vector comprising the DNA sequence of 
claim 10 or claim 11. 

10 

13. A cell transformed with the vector of claim 12. 



14. A method for producing a polypeptide comprising TGF-a 
precursor or a fragment thereof, comprising: 

15 (a) preparing an expression vector capable of acting 

in concert with a host cell to express a DNA sequence 
encoding said polypeptide; 

(b) transforming said host cell with the expression 
vector to obtain a recombinant host cell; 
20 (c) culturing the recombinant host cell under condi- 

tions for expression of the product; and 

(d) recovering the product from the host cell culture. 

15. A diagnostic method comprising assaying a body tissue 
25 or fluid for the presence of TGF-amC or TGF-aC. 



16. A diagnostic method. for determining TGF-amC 
comprising: 

(a) contacting a body fluid sample with an antibody 
30 capable of binding the TGF-aC domain of TGF-amC ; 

(b) contacting the body fluid sample with an antibody 
capable of binding the mature TGF-a domain of TGF-amC; 

(c) labelling the antibody of parts (a) or (b) with a 
detectable moiety; 

35 (d> insolubilizing the antibody which is not labelled; 
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(e) separating the soluble and insoluble phases; and 

(f) measuring the distribution of label between the 
phases . 

5 17. An antibody raised against a predetermined amino acid 
sequence within a PRTGF-a species. 

18. An antibody according to claim 17 for administration 
to an animal. 

10 

19. A sterile therapeutic composition comprising a 
pharmaceutically acceptable carrier and a therapeutically 
effective dose of a TGF-o binding agent for inhibiting the 
activation of cell surface receptors by TGF-o. 
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